基于随机抽样的加速K-均值聚类方法

doi:10.3969/j.issn.1006-2475.2013.12.007

计算机与现代化 ›› 2013, Vol. 12 ›› Issue (12): 27-29.doi: 10.3969/j.issn.1006-2475.2013.12.007

基于随机抽样的加速K-均值聚类方法

王秀华

收稿日期:2013-09-17 修回日期:1900-01-01 出版日期:2013-12-18 发布日期:2013-12-18

A Speeding K-means Clustering Method Based on Sampling

Received:2013-09-17 Revised:1900-01-01 Online:2013-12-18 Published:2013-12-18

摘要/Abstract

Abstract: To solve problems that traditional K-means clustering algorithm can not solve the large scale dataset clustering, this paper presents a speeding K-means clustering method based on random sampling, called Kmeans_RS clustering algorithm. The working set is selected from the original clustering dataset by random sampling and the traditional K-means clustering method is executed on this working set. Then the center and radius of every cluster is computed and the sampling result is obtained. The last clustering result of all dataset is obtained by measuring the relationship of sampling result and other data to cluster the remaining data. The random sampling way is used in this process and the size of K-means clustering is decreased, so the clustering efficiency is improved largely and it can be used to solve the large scale clustering problems. Simulation results demonstrate that the excellent clustering efficiency is obtained by this parallel speeding K-means method.


Key words: K-means clustering, random sampling, center, radius, working set, efficiency

中图分类号:

null

王秀华 . 基于随机抽样的加速K-均值聚类方法[J]. 计算机与现代化, 2013, 12(12): 27-29.

WANG Xiu-hua . A Speeding K-means Clustering Method Based on Sampling[J]. Computer and Modernization, 2013, 12(12): 27-29.

[1]	吕琼帅，熊蜀峰 . 基于GIS和遗传算法的饮水管网优化[J]. 计算机与现代化, 2013, 12(12): 23-26.
[2]	肖丹,尹春华 . 基于改进蚁群算法的用户有效浏览兴趣路径挖掘[J]. 计算机与现代化, 2013, 12(12): 14-18.
[3]	张锐丽,史凤隆,高万春 . 基于因子-聚类分析复合模型的维修保障能力评估方法[J]. 计算机与现代化, 2013, 12(12): 41-43.
[4]	叶培顺 . 非结构化P2P网络的一种改进搜索算法[J]. 计算机与现代化, 2013, 12(12): 44-47.
[5]	王俊1,2，汪继文1,2 . 基于N-S方程和纹理映射的实时火焰模拟[J]. 计算机与现代化, 2013, 12(12): 68-71.

基于随机抽样的加速K-均值聚类方法

A Speeding K-means Clustering Method Based on Sampling

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 5

编辑推荐

Metrics

本文评价